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Abstract 

Background: Softwood is the predominant form of land plant biomass in the Northern hemisphere, and is among 
the most recalcitrant biomass resources to bioprocess technologies. The white rot fungus, Phanerochaete carnosa, 
has been isolated almost exclusively from softwoods, while most other known white-rot species, including 
Phanerochaete chrysosporium, were mainly isolated from hardwoods. Accordingly, it is anticipated that P. carnosa 
encodes a distinct set of enzymes and proteins that promote softwood decomposition. To elucidate the genetic 
basis of softwood bioconversion by a white-rot fungus, the present study reports the P. carnosa genome sequence 
and its comparative analysis with the previously reported P. chrysosporium genome. 

Results: P. carnosa encodes a complete set of lignocellulose-active enzymes. Comparative genomic analysis 
revealed that P. carnosa is enriched with genes encoding manganese peroxidase, and that the most divergent 
glycoside hydrolase families were predicted to encode hemicellulases and glycoprotein degrading enzymes. Most 
remarkably, P. carnosa possesses one of the largest P450 contingents (266 P450s) among the sequenced and 
annotated wood-rotting basidiomycetes, nearly double that of P. chr/sosporium. Along with metabolic pathway 
modeling, comparative growth studies on model compounds and chemical analyses of decomposed wood 
components showed greater tolerance of P. carnosa to various substrates including coniferous heartwood. 

Conclusions: The P. carnosa genome is enriched with genes that encode P450 monooxygenases that can 
participate in extractives degradation, and manganese peroxidases involved in lignin degradation. The significant 
expansion of P450s in P. carnosa, along with differences in carbohydrate- and lignin-degrading enzymes, could be 
correlated to the utilization of heartwood and sapwood preparations from both coniferous and hardwood species. 
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Background 

The lignocellulose fraction of plant cell walls is the most 
abundant renewable carbon source on earth, and is a key 
resource for substituting petroleum in the production of 
energy, chemicals and materials. Among the various types 
of lignocellulosic biomasses, softwood is the predominant 
form of land plant biomass in the Northern hemisphere 
[1]. Softwood (coniferous) and hardwood (deciduous) fiber 
mainly differ in the structure and composition of hemicel- 
lulose and lignin components. For instance, glucuronoxy- 
lan comprises approximately 20-30% of the secondary cell 
wall polysaccharides in hardwood, while galactoglucoman- 
nan is the main hemicellulose in secondary cell walls of 
softwood. Moreover, softwoods contain mainly guaiacyl 
lignin, whereas hardwoods contain varying ratios of syrin- 
gyl and guaiacyl lignins [2]. The recalcitrance of softwood 
lignocellulose to bioprocess technologies has been attribu- 
ted to its higher lignin content, smaller pore size, and 
fewer hemicellulose-derived acetyl groups in comparison 
with hardwood [3]. 

Although relatively few organisms are known to effect- 
ively utilize all components of lignocellulose, white-rot 
basidiomycetes transform cellulose, hemicellulose and 
lignin, and so are regarded as essential contributors to 
global carbon cycling [4]. While most known white-rot 
species, including the model fungus Phanerochaete chry- 
sosporium [5,6], have been mainly isolated from hard- 
woods, the white-rot basidiomycete, Phanerochaete 
carnosa, has been isolated almost exclusively from soft- 
woods, including Abies balsamea (balsam fir), Abies con- 
color (white fir), and Pinus ponderosa (ponderosa pine) 
[7]. Previous studies have revealed differences in gene 
expression between P. carnosa and P. chrysosporium, 
suggesting that P. carnosa may possess an enzyme com- 
plement that is efficient for softwood bioconversion. For 
example, proteomic characterization of P. carnosa secre- 
tomes showed that P. carnosa produces a glycoside 
hydrolase (GH) family 2 mannanase, a multicopper oxi- 
dase, and glycopeptides that likely participate in carbo- 
hydrate and lignin degradation [8], which were not 
previously identified in proteomic analysis of P. chrysos- 
porium [6,9]. More recent transcriptomic analyses evalu- 
ated gene expression in P. carnosa grown on various 
softwood species (white spruce, lodgepole pine, and bal- 
sam fir), as well as a hardwood (sugar maple). Notably, 
transcripts predicted to encode lignin-degrading activ- 
ities (particularly manganese peroxidases) were more 
abundant than those predicted to encode carbohydrate- 
active enzymes [10], which is in contrast to earlier stud- 
ies of P. chrysosporium grown on wood that revealed 
comparatively high levels of transcripts encoding 
carbohydrate-active enzymes [11]. 

Key requirements for the biotransformation of particular 
biomass resources could be elucidated through comparative 



analysis of closely related lignocellulose-degrading fungi 
having different substrate preferences. For instance, 
genomic comparison of P. chrysosporium and the 
softwood-degrading, model brown-rot fungus, Postia 
placenta, revealed that brown-rot is characterized by the 
contraction of multiple gene families, including cellobio- 
hydrolases and cellulose-binding domains [12]. Moreover, 
comparative analysis of Aspergillus genomes identified cor- 
relations between genome content, plant polysaccharide 
degradation, and respective biotope [13]. Given the appar- 
ent differences in substrate preference of P. carnosa and 
P. chrysosporium, and their phylogenetic similarity based 
on internal transcribed spacer (ITS) region sequences [14], 
it is anticipated that comparative analysis of P. carnosa and 
P. chrysosporium genomes could reveal enzymes and meta- 
bolic pathways that are key to efficient biotransformation of 
recalcitrant softwood feedstocks. Accordingly, the present 
study reports the first analysis of the P. carnosa draft gen- 
ome, and compares P. carnosa and P. chrysosporium 
in terms of genome composition and organization, as 
well as growth on model and woody substrates. These 
analyses revealed significant expansion of P450 genes 
in P. carnosa compared to P. chrysosporium, and high- 
lighted several differences in carbohydrate active enzymes 
and lignin-degrading enzymes, which may facilitate soft- 
wood utilization by P. carnosa. 

Results and discussion 

Assembly and overview of the P. carnosa genome 

The net length of the P. carnosa genome sequence was 
46.3 Mb, and while distributed over 1137 scaffolds, the 
six largest scaffolds contained half of the total sequence 
and 57% of the 13,937 predicted genes (Additional file 1: 
Table SI, Table S2, Table S3). A summary of functional 
annotations by several classifications is shown in 
Additional file 1: Table S4. Of these genes, those pre- 
dicted to encode P450 monooxygenases, MFS transpor- 
ters, and signaling proteins (e.g. WD40 and protein 
kinases), comprised the largest families based on PFAM 
domains of translated sequences, and were expanded 
relative to other Basidiomycetes (Additional file 1: Table 
S5). Notably, the P. carnosa genome has 503 tandem 
gene duplication regions containing a total of 1660 
genes, significantly higher numbers than the P. chrysos- 
porium genome, which contains 305 duplication regions 
with 865 genes. Indeed, most of the top 50 PFAM 
domains are expanded in P. carnosa relative to P. chry- 
sosporium (Additional file 1: Table S5) and these are 
largely responsible for the differences in gene count and 
genome size between the two organisms. 

A synteny display based on orthologous proteins in 
P. carnosa and P. chrysosporium revealed significant 
rearrangement in the evolutionary history of these 
two Phanerochaete genomes (Figure 1). Most of the 
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Figure 1 Syntenic regions of the 10 largest scaffolds of P. carnosa with scaffolds of P. cbrysosporium. Scaffolds are shown as vertical bars, 
sequence starting from bottom to top, where the length of scaffolds is proportional to sequence length (largest: scaffold_1; smallest: scaffolds 9 
and 10). Colors on the scaffolds refer to positions of genes that are the best homologs (orthologs) to corresponding P. chrysosporium scaffolds of 
the same color. Distribution of genes encoding CAZymes (red bars) and FOLymes (blue bars) across the scaffolds are also shown to the left of 
each scaffold. 



carbohydrate- active enzymes [15] were distributed across 
the ten largest scaffolds of the P. carnosa genome se- 
quence. While the glycoside hydrolase (GH) families GH7 
and GH3 were enriched on scaffold_2, and the GH5 fam- 
ily was enriched on scaffold_4i most carbohydrate-active 
enzymes (CAZymes) were only loosely clustered. By con- 
trast, six of the seven predicted manganese peroxidases 
(MnPs) were located on scaffold_5, and three of the four 
predicted lignin peroxidases (LiPs) were located on scaf- 
fold_10 (Additional file 2: Table S6), indicating that cluster 
formation of MnPs and LiPs is much tighter than that 
of CAZymes. 

A primary aim of the current analysis was to compare 
the genomic content of P. carnosa and P. chrysosporium, 
and to correlate differences with their abilities to trans- 
form various compounds, including lignocellulosic sub- 
strates. Many previous reports demonstrate good 
correlation between genomic content and relative growth 
patterns on various carbon sources [13,16-18]. In those 
experiments, relative growth of a particular fungal species 
is determined by comparing the radius and density of the 
mycelia on a particular carbon source to that on D- 
glucose [19]. Relative growth patterns can then be com- 
pared between fungi, since differences in colony morph- 
ology and general differences in growth rates, are taken 
into account [16]. Accordingly, in addition to obtaining 



and analyzing the P. carnosa genome sequence, P. carnosa 
and P. chrysosporium were each grown in duplicate on 35 
model carbon sources to determine how well respective 
genome contents correlated to relative growth measure- 
ments. Similar to previous analyses, relative growth of 
each fungus was determined separately by comparing the 
radius and density of mycelia on a particular carbon 
source to that on D-glucose; relative growth profiles were 
then compared between the two species (Additional file 3: 
Figure SI) [19]. Notably, the relative growth oiP. carnosa 
on casein was higher than that observed for P. chrysospor- 
ium (Additional file 3: Figure SI), which is consistent with 
the considerable expansion of proteases encoded by 
P. carnosa (Additional file 1: Table S5). Similarly, where 
possible in the following sections, the growth profiles of 
P. carnosa and P. chrysosporium are correlated to relative 
occurrence of CAZymes and fungal oxidative lignin 
enzymes (FOLymes) encoded by these white-rot fungi. 

Carbohydrate active enzymes (CAZymes) and correlation 
to carbon catabolism 

Gene models encoding enzymes that were predicted to 
degrade, modify, or synthesize glycosidic bonds were 
annotated based on the CAZy classification scheme 
(Additional file 4: Table S7) [20-22]. The number of pre- 
dicted CAZymes encoded by P. carnosa, P. chrysosporium 
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and other Agaricomycotina genomes were also compared 
(Additional file 1: Table S8). As is usually found among 
wood decaying fungi, P. carnosa and P. chrysosporium en- 
code a large repertoire of GH61 members (11 and 13 
GH61s, respectively). Whereas the P. chrysosporium gen- 
ome encodes two GH61 proteins that are located together 
on scaffold_10 (Phchr41563 and Phchr41650, where num- 
bers indicate protein IDs from the JGI database), P. car- 
nosa encodes a single ortholog (Phaca263097), which was 
highly expressed during growth on different wood samples 
(Additional file 3: Figure S2). 

Although similar distributions of broad CAZy classifi- 
cations were observed, closer inspection of particular 
CAZy families revealed notable differences between 
P. carnosa and P. chrysoporium (Figure 2). For instance, 
both families GH23 and GH25 encode lysozyme activity, 
and the absence of GH25 in P. carnosa appeared to be 



compensated by a gain of a GH23 enzyme. Likewise, the 
absence of GT50 (a-l,4-mannosyltransferase) enzymes and 
CBMlSs in P. carnosa was balanced by a gain in genes that 
likely encode functionally similar proteins from families 
GT22 and CBM12. After family 1 CBMs, CBM13s com- 
prise the largest CBM family encoded by P. chrysosporium 
whereas P. carnosa totally lacks members from this CBM 
family. Although sugar binding by P. chrysosporium 
CBMlSs has not been characterized, xylooligosaccharides 
were shown to associate with a CBM13 binding domain 
from Streptomyces lividans [23]. Families GH5, GH79, 
CE16, and PL14 appear to have expanded in P. carnosa 
relative to P. chrysosporium (Figure 2), and contained a 
comparatively high fraction of proteins with relatively low 
identity (<60%) to orthologous proteins in P. chrysosporium 
(Figure 3). While families GH5 and CE16 include enzymes 
involved in hemicellulose degradation, GH79 includes 
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Figure 2 Distribution of CAZymes in P. carnosa (shadowed) and P. chrysosporium (lined). 
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Figure 3 The distribution of percent identities of CAZyme members between P. carnosa and P. chrysosporium. For each query sequence 
(P. carnosa genes), the highest percent identity in blastp was scored. The numbers of P. carnosa genes are indicated according to the percent 
identities. 



enzymes involved in processing glycoproteins, includ- 
ing those found in plant cell walls [24]. 

Given the expansion of GH5 members in P. carnosa 
and P. chrysosporium (24 and 19 GH5s, respectively), 
along with the role of GH5 enzymes in degrading (galacto) 
glucomannan present in softwood, and proficient colony 
development of P. carnosa on guar gum (galactomannan) 
compared to birchwood xylan and pectin (Additional 
file 3: Figure SI), a phylogenetic analysis of GH5 pro- 
tein sequences was performed to determine whether 
P. carnosa encodes new branches within this enzyme 
family (Additional file 3: Figure S3). This analysis identi- 
fied one ortholog of P. chrysosporium ^-mannanases 
(Phchr5115 and Phchrl40501; ManSC and ManSD, re- 
spectively) in P. carnosa (Phaca248589), and revealed that 
Phacal68586 clusters on a neighboring branch and with 
an orthologous protein most simOar to mannanases. Three 
additional GH5 sequences (Phaca253799, Phacal94783 and 
Phaca 194761) formed a branch in the protein phylogeny 
that lacked representation from P. chrysosporium. 
Since Phacal68586 shares comparatively low identity 
(56%) with the closest homolog in P. chrysosporium 
(Additional file 1: Table S9), and Phaca253799 was 
the only gene product from the unique phylogenetic 
branch that was upregulated in P. carnosa during 
growth on wood [10], Phacal68586 and Phaca253799 
might be particularly interesting to target for future 
biochemical characterization. 



Sugar transport and intracellular metabolism 

In addition to the secretion of extracellular glycoside 
hydrolases, the ability to metabolize wood polysacchar- 
ides depends on the transport of resulting sugars and 
oligosaccharides, as well as intracellular conversion of 
these compounds. 



Although genes encoding major facilitator superfamily 
(MFS) transporters appeared to have expanded in the 
P. carnosa genome, a phylogenetic analysis of predicted 
sugar transporters encoded by P. carnosa, P. chrysosporium, 
and other fungi formed eight sequence clusters, and 
revealed a similar distribution of P. carnosa and P. chrysos- 
porium proteins (Additional file 3: Figure S4). A previous 
transcriptomic analysis of P. carnosa revealed comparatively 
high expression of Phaca263731 on nutrient rich medium 
[10], while Phchrl37220, which also clusters in Group III, 
is upregulated in cellulose cultivations compared to glucose 
cultivations [25]. More interestingly, genes encoding four of 
the five Group VIII transport proteins were upregu- 
lated in P. carnosa grown on wood substrates (Additional 
file 3: Figure S4; [10]). This group comprises predicted cel- 
lobiose transporters previously found in yeasts and 
filamentous fungi [26-29]. Notably, a Group VIII cel- 
lodextrin transporter from N. crassa (NcNCU00801 
and 08114) has been biochemically characterized [30], 
supporting that P. carnosa transporter proteins found 
in this group could have a role in cellobiose and/or 
cellodextrin uptake. 

Sugar metabolism by P. carnosa and P. chrysosporium 
was further investigated by using corresponding genome 
sequences to reconstruct an entire metabolic network 
for each organism [31]. Gene products predicted to par- 
ticipate in central metabolism were then manually 
curated and expression levels were assigned from previ- 
ous transcriptomic analyses (Additional file 5: Table 
SIO). The results of this analysis revealed that the pen- 
tose catabolic pathway and the galacturonic acid path- 
way were induced in P. carnosa during growth on 
lignocellulosic substrates compared to nutrient medium, 
and that this response was accompanied by an increase 
in the expression of a transketolase, which catalyzes the 
conversion of D-xylulose-5-P to erythrose-4-phosphate 
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in the pentose phosphate pathway. Enzymes that catalyze 
the first step in rhamnose and galactose metabolism were 
also induced in P. carnosa wood cultivations [10]. Similar 
trends were observed in P. chrysosporium, although it has 
fewer gene products in the corresponding metabolic path- 
ways (Additional file 5: Table SIO). 

Notably, growth studies revealed that P. carnosa grew 
poorly on birchwood xylan and pectin (Additional file 3: 
Figure SI). Since the pentose and the galacturonic acid 
catabolic pathways likely promote the metabolism of 
pectin and xylan-derived sugars, metabolic models pre- 
dicted herein suggest that poor growth of P. carnosa on 
pectin and xylan substrates could result from low ex- 
pression of extracellular pectinase and xylanase enzymes 
or sugar transporters, rather than intracellular metabol- 
ism of hydrolysis products. 

Enzymes for lignin degradation, and conversion of related 
aromatic compounds 

The most evident difference in lignin-degrading activities 
encoded by P. carnosa and P. chrysosporium was the dis- 
tribution of predicted manganese peroxidases (MnP) and 
lignin peroxidases (LiP) (Additional file 1: Table Sll) [32]. 
The P. carnosa genome encodes eleven Class II peroxi- 
dases: seven manganese peroxidases (MnP: Phaca256980, 
144982, 256984, 256991, 256997, 94399, 262882) and four 
lignin peroxidases (LiP: Phaca212237, 263501, 213241, 
152156). By comparison, P. chrysosporium encodes five 
MnPs and ten LiPs (Additional file 1: Table Sll), suggest- 
ing that the two species might primarily rely on different 
peroxidase families for lignin decay. Growth profiling 
demonstrated that while both fungi grew well on wheat 
bran containing approx. 3% lignin, P. chrysosporium grew 
better than P. carnosa on cotton seed pulp (approx. 30% 
lignin) (Additional file 3: Figure SI), suggesting a higher 
ligninolytic ability for P. chrysosporium. This result is con- 
sistent with the expansion of MnPs and reduction of LiPs 
in P. carnosa compared to P. chrysosporium, and the typ- 
ically higher redox potential of LiP oxidoreductases. 

A phylogenetic tree was previously constructed to infer 
the relationship of Class II peroxidases from P. carnosa and 
P. chrysosporium, and tree topology indicated gene expan- 
sion after P. chrysosporium and P. carnosa speciation [10]. 
Closer analysis of the protein phylogeny suggested that a 
common ancestor of P. carnosa and P. chrysosporium likely 
encoded three mnp genes and at least three lip genes. An 
analysis of intron distribution revealed the existence of up 
to seven conserved intron positions in the coding regions 
of P. carnosa and P. chrysosporium mnp genes, and up to 
ten conserved intron positions in the coding regions of the 
lips (Additional file 3: Figure S5). Most of these genes have 
lost one or two introns, suggesting a role for retrotransposi- 
tion in gene expansion. 



Among the nine predicted multi-copper oxidases 
(MCO) was a conventional ferroxidase (fet3; Phacal41262), 
and seven genes with high similarity (> 65% amino acid 
identity) to P. chrysosporium mco4 (Phchrl0581): these 
included Phaca261553, Phaca261563, Phacal49761, 
Phacal00787, Phacal00639, Phaca 149824, Phacal86926, 
which were all located on scaffold_8. Two additional genes, 
Phaca261609 and Phaca60261, were more closely related 
to P. chrysosporium mco3 and mco2, respectively. Notably, 
only two of these MCO-encoding genes, Phacal41262 
{fet3) and PhacalOG639 {mco4), appear to be upregulated 
during growth on wood [10]. The predicted cellobiose de- 
hydrogenase {cdh: Phaca259608) and an ortholog to cirl 
(cellulose-binding iron reductase, Phacal61126) were also 
up-regulated in P. carnosa during growth on wood [10]. As 
in the case of P. chrysosporium, P. carnosa does not appear 
to encode laccases sensu stricto. However, recent work with 
a Phanerochaete flavido-alba MCO shows that some 
members of the ferroxidase/laccase group may in fact have 
laccase activity [33] . 

Various enzymes are proposed to supply the H2O2 
required for oxidase activity, the best established of 
which is glyoxal oxidase (GLOX, FOLy LDA3). The 
P. carnosa genome contains one ^/ox gene (Phaca258261) 
and five related copper radical oxidases {cro: Phacal23913, 
Phaca259359, Phacal43144, Phaca263533, Phaca263528), 
compared to one glox and six related cro in P. chry- 
sosporium (Additional file 1: Table S12). Phacal23913, 
Phaca259359, and Phacal43144 were most closely 
related to crol, cro2, and cro6 of P. chrysosporium 
(Phchr259359, Phchrl23913, Phchr258261), respectively. 
Both Phaca263533 and Phaca263528 were similar to cro3, 
cro4, and croS, and contain repeated WSC domains of un- 
known function. These two WSC-containing cros are 
found within the lip physical gene cluster on scaffold_10. 
Notably, of the five copper radical oxidases identified here, 
only Phaca263528 was upregulated in P. carnosa during 
growth on wood [10]. 

Cytochrome P450 monooxygenases 

Fungi encode a large repertoire of P450 genes [5,12,34], 
and in wood-rotting basidiomycetes in particular, P450s 
are known to be involved in the oxidation of phenolic 
and non-phenolic aromatic compounds with substruc- 
tures similar to those occurring in lignin and extractive 
compounds that otherwise inhibit microbial growth [35- 
38]. Accordingly, the distinct lignin composition of hard- 
woods and softwoods, as well as the variety of extrac- 
tives produced by different wood species, provoked an 
interest to compare P. carnosa and P. chrysosporium in 
terms of the P450s they encode. 

The current genomic analysis revealed that P. carnosa 
possesses one of the largest P450 contingents (266 
P450s) among the sequenced and annotated wood- 
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rotting basidiomycetes, much higher than that of P. chry- 
sosporium (149 P450s) [34], somewhat larger than that of 
the brown rot fungus P. placenta (236 P450s) [12], and 
higher than any of the 31 fungal genomes recently 
reported by Floudas et al. [18]. Among the large number 
of tandem gene duplication events in the P. carnosa gen- 
ome, the largest tandemly duplicated gene family is P450 
with 38 duplication regions resulting in 80 genes, while 
corresponding numbers in the P. chrysosporium genome 
are 15 and 23 (Additional file 1: Table S13). Roughly 33% 
of the P450 genes encoded by P. carnosa appear to 
be tandemly duplicated. According to the phylogen- 
etic analysis, tandem duplication of at least 15 P450 
genes occurred after divergence from P. chrysosporium. 
For example, four tandemly duplicated genes found in 
scaffold 7 (1419983-1433634) were 80-90 identical, 
and shared roughly 60% identity with the best homo- 
log in P. chrysosporium. 

The 266 P450s of P. carnosa were grouped into 10 
known P450 clans, 36 families and 77 subfamilies, with 



CYP64 being the largest clan with 124 members (Figure 4, 
Additional file 1: Table S14). In comparison to P. chrysos- 
porium, several P450 clans (CYP52, CYP53, CYP54, 
CYP64, CYP67, CYP534 and CYP547) and many families 
had expanded in P. carnosa (Figure 5 A and B). Interest- 
ingly, CYP64 and CYP52 were nearly doubled in P. car- 
nosa and genes from these clans represented over half of 
the 28 unique P450 gene transcripts that were upregulated 
in P. carnosa wood degrading cultures ([10], Additional 
file 1: Table S15). 

Transformation of softwood and hardwood components 
by P. carnosa and P. chrysosporium 

The genomic analysis of P. carnosa revealed that among 
the most remarkable differences between this organism 
and P. chrysosporium was the apparent enrichment in 
P450 genes and the difference in distribution of LiP and 
MnP genes. P. carnosa has been mainly isolated from 
softwood wood species, and so an interesting prospect is 
that growth of this organism on softwood is facilitated 
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Figure 4 Phylogenetic tree of P. carnosa cytochrome P450 proteins (P450ome). Phylogenetic analyses were conducted in MEGA4 using the 
UPGMA metlnod [75]. A total of 210 P450s {> 330 amino acids length) were grouped under ten existing clans and one unassigned clan (UA), with 
clans shown in different colors. P450 members that were upregulated in wood degrading cultures [10] are indicated with asterisks and are listed 
in Additional file 1: Table SI 5. 
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Figure 5 Comparative evolutionary analysis of the P450omes of P. carnosa and P. chrysosporium. The two P450omes were compared at 
the clan (A) and family (B) levels. In the P. carnosa P450ome (266 P450s) (green bars), 10 of the 1 1 clans and 32 of the 36 families corresponded 
to those present in P. chrysosporium (149 P450s) (red bars). Numbers on the P. carnosa clan or family bars represent the membership expansion 
(+) or reduction (-) in that clan or family as compared to P. clirysosporium. 



by the oxidative transformation of guaiacyl lignin and 
otherwise inhibitory wood extractives. To test this possi- 
bility, P. carnosa and P. chrysosporium were grown on 
heartwood and sapwood preparations from various hard- 
woods (sugar maple, yellow birch, trembling aspen) and 
softwoods (red spruce, white spruce, balsam fir, red 
pine), and residual wood extractives and lignin were 
measured using Ultra High Performance Liquid Chro- 
matography (UPLC) and Fourier transform infrared 
spectroscopy (FT-IR), respectively. 

UPLC analyses revealed that while P. carnosa and 
P. chrysosporium similarly reduced the total phenolic 
content of sapwood samples, P. carnosa transformed 
a higher fraction of phenolics in most of the heart- 
wood samples, including those from softwood species 
containing high initial phenolic content (Figure 6). 
Interestingly, P. chrysosporium transformed a broader 
range of heartwood phenolics in maple than P. carnosa, 
and this trend was also observed albeit to a lower ex- 
tent, for aspen heartwood (Additional file 1: Table S16). 



Moreover, growth of P. chrysosporium on heartwood from 
softwood species was only observed in red spruce cultiva- 
tions (Additional file 3: Figure S6), which contained 
relatively low initial phenolic content. In some cases, 
the transformation of specific phenolic compounds by 
P. carnosa could also be correlated to growth. In particu- 
lar, the transformation of kaempferol 3-O-p-D-glucoside 
and naringin in heartwood from white spruce was higher 
for P. carnosa than P. chrysosporium (Additional file 1: 
Table SI 6), and P. carnosa exhibited better growth on 
white spruce (Additional file 3: Figure S6). Both kaempferol 
3-O-p-D-glucoside and naringin are sugar-containing 
phenolic compounds, where the sugar moiety can be 
released by GHl P-glucosidase activity and GH13, 
GH15 or GH78 a-rhamnosidase activity, respectively. 
Since GH families 1, 13, 15 and 78 did not substan- 
tially differ between P. carnosa and P. chrysosporium, 
detoxification of the phenolic moiety of kaempferol 
3-O-P-D-glucoside and naringin might have pro- 
moted the utilization of these compounds. 
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Figure 6 Concentration of phenolic compounds in whole extractives of wood samples. P. carnosa (lines) and P. chiysosporium (white) were 
grown on heartwood (A) and sapwood (B) wood samples at 27°C P. carnosa grew significantly slower than P. chrysosporium, and so samples 
were taken for analysis after 42 and 18 days of cultivation for P. carnosa and P. chrysosporium, respectively (for colony diameter, see Additional 
file 3: Figure S6). Grey bars indicate the phenolic content of wood samples prior to fungal cultivation. The amount of phenolics in total extractives 
was measured using the F-C reagent method and calculated as percent concentration [77], Gallic acid was used to generate a calibration curve. 
Error bars show the standard deviation in biological triplicates. 



Aspen and pine wood samples were also analyzed by 
FT-IR to identify differences in lignin and carbohydrate 
components resulting from P. carnosa or P. chrysospor- 
ium cultivation. Principal component analysis (PCA) of 
wood FT-IR spectra in the fingerprint region (800- 
1800 cm' ) clearly separated wood samples decayed by 
P. carnosa and P. chrysosporium (Additional fUe 3: Figure 
S7). Overall, wood samples collected from P. carnosa cul- 
tivations were distinguished from control samples by a re- 
duction in lignin (Additional file 3: Figure S7; Additional 
file 1: Table S17); wood samples from P. chrysosporium 



cultivations were further distinguished by more extensive 
polysaccharide degradation (Additional file 3: Figure S7; 
Additional file 1: Table S17). Notably, peak intensities at 
wavenumbers characteristic of cellulose absorption were 
reduced in heartwood pine cultivations, even though 
growth of P. chrysosporium was not visible on this wood 
sample (Additional file 3: Figure S6). The decay patterns 
revealed by the FT-IR analysis suggest that while P. chry- 
sosporium is known to elicit simultaneous decay of lignin 
and polysaccharides, P. carnosa likely elicits sequential 
degradation of cell wall components. These results are 
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consistent with previous reports that illustrate compara- 
tively high expression of genes encoding lignin-degrading 
activities in P. carnosa, and cellulose-degrading activities 
in P. chrysosporium, during growth of these fungi on vari- 
ous wood samples [10,11,39]. 

Conclusions 

An underlying hypothesis of the current study was that 
identifying subtle differences in the distribution and se- 
quence of genes encoded by P. carnosa and P. chrysospor- 
ium would facilitate the discovery of candidate enzymes 
that promote the conversion of recalcitrant softwood feed- 
stocks. Indeed, the comparative genomic analysis of these 
related white-rot fungi revealed a tangible set of enzyme 
classes that could promote the conversion of softwood. In 
particular, this study revealed that the most divergent 
CAZymes belonged to families GH5, GH79, CE16, and 
PL14, that the relative abundance of genes encoding MnP 
and LiP oxidoreductases was reversed in P. carnosa com- 
pared to P. chrysosporium, and that P. carnosa encodes 
the largest contingent of P450 enzymes than any other ba- 
sidiomycete characterized to date. 

Specific hemicellulases, particularly from family GH5, 
were identified for future biochemical characterization. 
However, the clearest correlations between growth studies 
and differences in encoded enzymes were in regards to lig- 
nin removal and ability to transform wood extractives. 
While the predicted differences in the redox potential of 
MnP and LiP was consistent with slower growth of P. car- 
nosa on wheat bran and cotton seed pulp compared to 
P. chrysosporium, oxidants produced by MnP might 
diffuse more easily through dense lignocellulose struc- 
tures than LiP associated oxidants [40]. P. carnosa 
removed a higher proportion of phenolic extractives 
in heartwood samples of softwood than did P. chrysos- 
porium; P. carnosa also showed better growth on these 
feedstocks. Given the comparatively high expression of 
P450 genes from the CYP64 and CYP52 clans in P. car- 
nosa grown on woody substrates [10], an intriguing 
possibility is that the expansion of CYP64 and CYP52 
families discovered through genome sequencing enables 
P. carnosa to colonize heartwood from both softwood 
and hardwood species. The implication of this ana- 
lysis is that detoxification of wood extractives and 
ability to degrade dense lignin structures may be key to 
enhancing softwood bioconversion. 

Methods 

Data access 

P. carnosa genome assembly and annotations are avail- 
able via JGI Genome Portal MycoCosm [41] and DDBJ/ 
EMBL/GenBank under the accession AEHBOOOOOOOO. 



Fungal strains 

Phanerochaete carnosa strain HHB-10118-sp and Pha- 
nerochaete chrysosporium strain RP-78 were obtained 
from the U.S. Department of Agriculture (USDA) Forest 
Products Laboratory (Madison, WI). These strains were 
maintained on YMPG agar plates under stationary con- 
ditions at 27°C. YMPG medium contained 2 g yeast ex- 
tract, 10 g malt extract, 2 g peptone, 10 g glucose, 2 g 
KH2PO4, 1 g MgS04-7H20, and 15 g agar per 1 L in 
H2O. 

Confirming Homol<aryosis 

Single-copy cdh and fet3/ftrl genes of P. carnosa were 
amplified by PGR and sequenced to reveal an absence of 
allelic polymorphism [42]. Specifically, PGR was per- 
formed using AccuPrime Pfx DNA polymerase and Re- 
action Mix (Invitrogen) with 160 ng of genomic DNA 
and 12.5 pmol of each primer in a 25 |iL reaction vol- 
ume. Primers for amplification of cdh (5'-TCKGARGC 
HGGVAAGAARGT-3' and 5'-GGVCCRATVCCGCTYT 
GGAA-3') were designed based on conserved sequences 
from six Basidiomycete fungi, and the PGR cycle was 
run as follows: 95°G for 9 min, 30 cycles of (95°C for 
1 min, 50°C for 2 min, 72°G for 2 min), and 72°C for 
15 min. Primers for amplification of fetSlftrl (5'- 
TGGAGGATCTGGAAGTTGTG-3' and 5'-TGTGACG 
GAAGAGGATGAAG-3') were based on the correspond- 
ing P. chrysosporium sequence, and the PGR cycle was 
run as follows: 95°G for 9 min, 30 cycles of (95°G for 1 min, 
65°C for 2 min, 72°C for 3 min), and 72°C for 15 min. 
Amplified sequences were cloned into the pGR2.1-TOPO 
plasmid (Invitrogen) and sequenced at the Analytical 
Genetics Technology Gentre (Toronto, ON, Ganada) or 
the Gentre for Applied Genomics (Toronto, ON, Ganada). 
Absence of clamp connections normally produced by het- 
erokaryons during cell division [43] was also confirmed by 
microscopic visualization of P. carnosa mycelia. 

Genome sequencing and assembly 

The genome was sequenced using a combination of San- 
ger (4 kb, 8 kb, 40 kb paired end), 454 (Titanium un- 
paired; 3.7 kb, 5.2 kb, 6.1 kb, 14.5 kb Titanium paired 
end), and Illumina (3 lanes of 2x76 bp, 0.3 kb insert 
paired end) sequencing platforms (Additional file 1: 
Table Sla). All general aspects of library construction 
and sequencing can be found at the JGI website [44]. 
The Illumina data was assembled with the Velvet assem- 
bler (version 0.7.55; [45]) with a hash length of 61 and 
the following options; -ins_length 250 -scaffolding no 
-exp cov 16 -cov_cutoff 5, to produce an assembly with 
a final graph with 51765 nodes, n50 of 2413, max 22432, 
total 37740533, using 73178660/94020728 reads. Gontigs 
greater than or equal to 800 bp in length were shredded 
into 1000 bp fragments with 800 bp overlap to be used 
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by Newbler assembler. After eliminating possible con- 
taminant data, the combined set of velvet fragments, 
454 and Sanger reads was assembled with the Newbler 
assembler, version 2.5-internal-10Apr08-l with the fol- 
lowing options; -fe reads2remove.FQC -consed -nrm 
-finish -info -rip -sio -g -ml 31 -mi 98 -e 48, to a final 
estimated assembled coverage of 58X, 1137 scaffolds 
with an N/L50 of 6/3.6 Mb, and 2687 contigs with an L/ 
N50 of 248/45.2 Kb (Additional file 1: Table Sib). One 
round of automated gap closure using our in house 
gapResolution tool [46] resulted in a final assembly with 
2272 contigs with an N/L50 of 139/74.8 kb. Newbler 
assembled consensus EST sequence data was used to as- 
sess the completeness of the final assembly using align- 
ment with 90% identity and 85% coverage thresholds. This 
resulted in 90.33% placement. 

cDNA library construction, sequencing and assembly 

Two clone cDNA libraries were constructed using RNA 
from P. carnosa mycelium grown in liquid YMPG 
medium and sequenced as described previously [47], 
with the minor difference that there were the two size 
ranges of cDNA (0.6 k-2 kb and >2 kb). The smaller 
cDNA insert library resulted in 9,971 ESTs for further 
processing and the larger insert library resulted in 9,530 
ESTs. RNA purified from mycelia grown on mixed soft- 
wood was used to construct a 454 cDNA library accord- 
ing to the cDNA Rapid Library Preparation Method 
Manual (Rosch, Germany), yielding 1,139,862 ESTs for 
clustering and assembly. The entire set of 1,159,363 
reads were assembled using Newbler (v2.3-PreRelease-6/ 
30/2009) with default parameters resulting in 16,234 
contigs greater than 50 bp. and 59,361 singlets. 

Genome annotation 

The P. carnosa genome was annotated using the JGI an- 
notation pipeline, which takes multiple inputs (scaffolds, 
ESTs, and known genes) and runs several analytical tools 
for gene prediction and annotation. Results were depos- 
ited in the JGI Genome Portal [48], a part of the inte- 
grated fungal resource MycoCosm [42] for further 
analysis and manual curation. 

Genomic assembly scaffolds were masked using 
RepeatMasker [49] and the RepBase library of 234 fungal 
repeats [50]. tRNAs were predicted using tRNAscan-SE 
[51]. Using the repeat-masked assembly, several gene 
prediction programs falling into three general categories 
were used: 1) ab initio - FGENESH [52]; GeneMark [53], 
2) homology-based - FGENESH+; Genewise [54] seeded 
by BLASTx alignments against GenBank's database of 
non-redundant proteins (NR: [55]), and 3) EST-based - 
EST_map [56] seeded by EST contigs. Genewise models 
were extended where possible using scaffold data to find 
start and stop codons. EST BLAST alignments [57] were 



used to extend, verify, and complete the predicted gene 
models. The resulting set of models was then filtered for 
the best models, based on EST and homology support, 
to produce a non-redundant representative set of 13,937 
gene models with characteristics described in Additional 
file 1: Table S2. This representative set was subject to 
further analysis and manual curation. 

Predicted gene models were functionally annotated 
using SignalP [58], TMHMM [59], InterProScan [60], 
and BLASTp [61] against the National Center for Bio- 
technology Information nr database, and hardware- 
accelerated double-affine Smith- Waterman alignments 
[62] against [63] KEGG [64] and KOG [65]. KEGG hits 
were used to assign EC numbers [66], and Interpro and 
SwissProt hits were used to map GO terms [67]. Func- 
tional annotations of the representative set of genes are 
summarized in Additional file 1: Table S3 and Table S4. 
Multigene families were predicted with the Markov clus- 
tering algorithm (MCL [68]) using BLASTp alignment 
scores between proteins as a similarity metric. 

Carbohydrate active enzymes 

All P. carnosa protein models were subjected to a 
procedure combining BLAST and HMMer3 searches 
against sequence libraries and HMM profiles derived 
from the families of glycoside hydrolases, polysac- 
charide lyases, carbohydrate esterases, glycosyltrans- 
ferases and carbohydrate-binding modules featured in 
the CAZy database [15,69]. The models correspond- 
ing to glycoside hydrolase families GHl, GH2, GH3, 
GH5, GH6, GH7, GH9, GHIO, GHll, GH12, GH16, 
GH28, GH31, GH43, GH45, GH51, GH53, GH55, 
GH61, GH74, GH79, GH115, carbohydrate esterase 
families CEl, CE15, CE16, carbohydrate-binding module 
family CBMl, as well as cellobiose dehydrogenase and 
cellulose-binding cytochrome ^552 [70] were manually 
checked using TBLASTN against the P. carnosa assembly 
database [48]. Gene models used for BLAST were 
obtained from the P. chrysosporium genome database 
[6,71]. In the case of multi-domain proteins, sequences en- 
coding CBM and its associated domains were separately 
used as separate queries. TBLASTN programs were per- 
formed with an expectation value of l.OE'^, and all other 
settings at default values. 

Sugar transporters 

Protein models that were annotated as predicted sugar 
transporters and/or permeases in the P. carnosa genome 
portal vl.O were used as queries for BLASTp against the 
NCBI protein database [55] and the P. chrysosporium 
genome portal version 2 to confirm these annotations. 
Annotations were considered accurate when either 
BLAST search gave an alignment to a predicted pro- 
tein with an E-value of < le-lO and a score of > 200. 
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The P. carnosa genome was then re-searched using the 
P. chrysosporium genes selected above as queries to con- 
firm that all of the gene models relevant to this analysis 
were selected. 

For the phylogenetic analysis, multiple alignment was 
performed using MAFFT version 6 software [72] with 
the E-INS-i algorithm. The phylogenetic tree was then 
constructed from the multiple alignment using the boot- 
strapped neighbor-joining method (1000 bootstraps), 
and drawn using FigTree version 1.3.1 [73]. This analysis 
included 28 gene models from P. carnosa and 25 gene 
models from P. chrysosporium, while partial gene frag- 
ments were removed. 

Oxidoreductases 

The BLASTp algorithm available through the JGI Fungal 
Genomics Program website [41] was used with default 
settings to search Agaricomycotina gene catalogue pro- 
teins against reference proteins. Hits were then blasted 
against the NCBI database [55] with default settings, and 
aligned to the reference protein sequences using the tool 
at the Genestream Bioinformatics Resource server [74]. 
Sequences were annotated to the reference protein when 
the bests hits to NCBI represented sequences of interest 
and the alignment showed at least 30% amino acid iden- 
tity to the reference protein. Reference proteins were 
chosen based on biochemical evidence supporting their 
identity, and correspond to the following Genbank ac- 
cession numbers: LOl (laccase) LAC2_PLEOS, L02 
(peroxidases) LIG8_PHACH, LOS (cellobiose dehydro- 
genase) CDH_PHACH, LDAl (aryl alcohol oxidase) 
AAC72747, LDA2 (vanillyl-alcohol oxidase) VAOX_- 
PENSI, LDA3 (glyoxal oxidase) AAA33747, LDA4 (pyra- 
nose oxidase) P20X_PHLGI, LDA5 (galactose oxidase) 
XP_959153, LDA6 (glucose oxidase) XP_002910108, 
LDA7 (benzoquinone reductase) AAD21025, LDA8 (al- 
cohol oxidase) AAB57849, methanol oxidase ALOX_PI- 
CAN, quinone reductase AF465406. 

P450 monooxygenases 

Initial determination of the putative cytochrome P450 
gene models in P. carnosa was made by searching the 
JGI whole genome database for the term 'P450'. The 
resulting putative sequences were subjected to BLAST 
analysis and searched for the presence of the conserved 
P450 signature domains namely, the oxygen-binding 
motif 'EXXR' and the heme-binding motif 'CXG'. P450s 
that showed both the domains were considered authen- 
tic and were grouped into families and subfamilies based 
on the existing nomenclature criteria of > 40% amino 
acid homology for assigning a family and > 55% for a 
subfamily. The families were then grouped into P450 
clans, a higher order level of nomenclature that repre- 
sents a cluster of P450 families across species, grouped 



based on relationships that are beyond the family desig- 
nations [57]. P450 superfamily nomenclature rules were 
followed for assigning the clan, family and sub-family 
classification as earlier applied for P450ome classifica- 
tion in the model white-rot fungus, P. chrysosporium 
[34]. P450s that did not have P. chrysosporium P450 
homologues were annotated based on the phylogenetic 
alignment with other P450s on the phylogenetic tree; the 
tree was constructed using Mega 4 via the bootstrap 
UPGMA method [75]. P450s showing both the con- 
served domains and a reasonable deduced protein length 
(> 330 aa) were used for the tree construction. 

Metabolic network reconstruction 

The P. carnosa metabolic network was reconstructed 
using version 15 of the Pathway Tools Software from 
SRI International Inc [76]. This network was primarily 
constructed from the annotation using Enzyme Com- 
mission, Gene Ontology identifiers as well as name 
matching algorithms. The reconstructed metabolic net- 
work contained 1166 metabolites, 1630 enzymatic reac- 
tions and 11 transport reactions that were linked to 
3292 enzymes. The completed network is available at 
[31]. 

Cultivation on model and wood substrates 

P. carnosa and P. chrysosporium were grown in duplicate 
on modified Kremer and Wood medium containing 
1.5% agar with 35 different model carbon sources [19] 
for 13 and 3 days, respectively, after which pictures of 
the plates were taken to compare colony diameter and 
thickness. Relative growth was determined by comparing 
the radius and density of the mycelia on a particular car- 
bon source to that on D-glucose; relative growth profiles 
were then compared between the two fungal species. 
The extent of growth relative to plates containing glu- 
cose are categorized from high to non-detectable using 
the following designations: +++, ++, +, ±, -. No differ- 
ences were observed between duplicates on any of the 
substrates, so only one of the pictures is presented. 
Monomeric and oligomeric carbon sources were used at 
a final concentration of 25 mM, while pure polymers 
were used at a final concentration of 1%. Crude plant 
biomass was used at a final concentration of 3%. This 
procedure allows consistent, qualitative assessment of 
colony development on many substrates; it has been per- 
formed routinely for more than 100 fungal species and 
showed good overall correlations with the genome con- 
tent of these species [16,18,19]. 

The following hardwood and softwood species were 
used for wood cultivations: sugar maple {Acer sac- 
charum), yellow birch (Betula alleghaniensis), trembling 
aspen {Populus tremuloides), red spruce [Picea rubens), 
white spruce (Picea glauca), balsam fir (Abies halsamea), 
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and red pine [Pinus resinosa). All wood samples were 
obtained from New Brunswick, Canada. A 50 cm bolt at 
80 cm and 130 cm trunk heights were cut from each 
wood species. Sapwood and heartwood sections were 
then separated, air-dried, and processed separately using 
a Wiley mill (Thomas scientific, NJ, USA). Resultant 
wood powder was sifted using 2 mm and 0.425 mm 
diameter mesh sieves and powder that passed through 
the 2 mm sieve but was retained by the 0.425 mm sieve 
was recovered. Four grams of wood powder were placed 
on top of 5 g vermiculite powder (< 1 mm dia.) in a glass 
petri dish measuring 9 cm in diameter; 20 mL of H2O 
was gently added to the dish, and the dish was then 
autoclaved for 20 min. A 0.5 cm dia. agar plug taken 
from the growing edge of P. carnosa or P. chrysosporium 
cultivated on YMPG agar plates was then transferred to 
the centre of each plate, and incubated at 27°C under 
stationary conditions. To maintain moisture content, 
5 mL H2O was added to each plate every week during 
cultivation. Fungal growth was quantified by measuring 
the diameter of the fungal colony growing on each wood 
powder. Three replicate cultivations were prepared for 
each fungal and wood species. 

Extractives analysis 

An accelerated solvent extraction method (DIONEX, Ap- 
plication Note 335) was used to isolate wood extractives. 
To obtain standard samples for baseline analysis, 2 g of 
non-treated heartwood and sapwood samples from each 
wood species were mixed with approximately 0.45 g of 
diatomaceous earth, and then transferred to an 1 1 mL cell; 
the headspace in the cell was then filled with sand. Extrac- 
tion was performed as follows; preheat 0 min, heat 5 min, 
static 5 min, flush 90%, purge 60 sec, cycles 5, pressure 
1000 PSI, temperature lOO'C, solvent 70% MeOH and 
30% H2O (vol/vol). To obtain extractives from fungal- 
treated and control samples, each sample was mixed with 
solvent (70% MeOH, 30% H2O (vol/vol)) in a 10:1 ratio 
(solvent(mL):sample weight (mg)) and incubated for 24 h 
on an orbital shaker at room temperature, and then fil- 
tered. Supernatant was collected and a second extraction 
was performed on the filtered wood samples. Supernatants 
from the first and second extractions were combined, con- 
centrated using a rotary evaporator, and then dried using a 
nitrogen evaporator. 

Extractives were analyzed using Ultra performance li- 
quid chromatography (UPLC). UPLC analysis was per- 
formed using a Waters Acquity Ultra Performance Liquid 
Chromatography equipped with a computer and Masslynx 
software, a binary solvent manager, a sample manager and 
an autoscan photodiode array spectrophotometer detector 
(PDA ek). The UPLC was equipped with an Acquity 
UPLC BEH C18, 1.7 ^m, (2.1 x 50 mm i.d.) reverse-phase 
analytical column from Waters housing a Van Guard BEH 



CIS, 1.7 |im pre-column. All samples were dissolved in 
70% acetonitrile: 30% H2O and diluted to a concentration 
of 10 mg/mL or to a minimum of 0.25 mL. Standards 
included gallic acid, methyl gallate, quercetin and rutin; 
0.5 i^L of each standard and 3 |iL of samples were analyzed 
using gradient elution as shown in Additional file 1: Table 
S18. Column and auto-sampler temperature were main- 
tained at 25°C. Two fixed detection wavelengths (280 nm 
and 350 nm) were used to monitor the eluting peaks. 
Resolved peaks were scanned by the photodiode array de- 
tector from 240 to 460 nm. 

The F-C reagent method was used to calculate total 
phenolic concentration [77] . A calibration curve was cre- 
ated using gallic acid at concentrations of 25 mg/L, 50 
mg/L, 100 mg/L, 250 mg/L, and 500 mg/L in Milli-Q 
water. Briefly, 20 |^L of sample, gallic acid standard or 
blank was transferred to a 2.0 mL cuvette; 1.58 mL of 
Milli-Q water and 100 |iL of F-C Reagent were then 
added, mixed and incubated for 5 min. Subsequently, 
300 |iL of sodium carbonate was added, mixed and incu- 
bated for 2 h at room temperature. Absorbance was 
measured at 765 nm using a Beckman 800 series 
spectrophotometer. 

FT-IR spectroscopy 

Heartwood and sapwood of trembling aspen and red 
pine samples were collected before and after cultivation 
with P. carnosa and P. chrysosporium, and powdered 
using the mini-beadbeater-16 (Biospec products, USA); 
corresponding uninoculated controls were similarly pro- 
cessed. Two milligrams of wood powder were mixed 
with KBr (200 mg) and the mixture was pelletized using 
a die (1.3 cm diameter) and a hydraulic press. A Bruker 
Tensor 27 FT-IR was used to record the absorbance be- 
tween 4000 and 400 cm '^ with a resolution of 4 cm'^. 
Spectra representing the average of 32 scans were cor- 
rected for atmospheric vapor compensation; baseline 
was corrected using the rubber band method (Opus soft- 
ware, V. 5.0). Spectra were normalized for unit- vector 
and mean-centred prior to the principal component ana- 
lysis (PCA) using Unscrambler v. 9.7 software. 

Additional files 

f ^ 

Additional file 1: Table SI. P. carnosa sequencing summary. This 
describes the summary of libraries that were constructed for genome 
sequencing. Table S2. P. carnosa assembly summary. This describes the 
summary of the genome sequence assembly. Table S3. Gene model 
support by different lines of evidence. Statistics of predicted gene 
models. Table S4. Summary of Pcarnosa annotations. This summarizes 
genome annotation according to various classifications. Table S5. Top 50 
PFAIVl domains in Phanerochaete genomes. List of top 50 PFAM domains 
annotated in Phanerochaete genomes. Table S8. Comparison of the 
number of CAZymes in wood decaying basidiomycotina. General 
comparison of CAZyme gene members in various basidiomycotina 
[5,6,12,17,78,79]. Table S9. CAZymes that showed <60% identity to Phchr 
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orthologs. List and gene expression data of CAZy members siiowing low 
sequence identity witli ortfiologs in P. chtysosporium [10]. Table Sll. 
Comparison of the number of FOLymes in P. carnosa and selected 
Agaricomycotina. General comparison of FOLy members between several 
Agricomycotina [78]. Table S12. Summary of oxidoreductases potentially 
involved in lignocellulose degradation by P. carnosa (Phaca) and P. 
chrysosporium (Phclir). Summary and comparison of specific 
oxidoreductase members between Phanerochaetes. Table S13. Tandem 
duplication of P450 genes in basidiomycete genomes. Summary table of 
P450 tandem duplication in known basidiomycete genomes [5,6,12,17,78- 
80]. Table S14. P450ome classification in P. carnosa and its membership 
comparison with P. hrysosporium. Summary table of P450 members in P. 
carnosa and P. chrysosporium. Table SI 5. P450s upregulated in wood 
degrading cultures. List of P450 members in P. carnosa that were 
upregulated during wood degrading cultivation [10]. Table SI 6. UPLC 
peaks corresponding to wood-derived phenolic compounds that were 
transformed by P. carnosa or P. chrysosporium. Peak annotation and 
analysis of UPLC chromatograms. Table SI 7. Assignment of FT-IR peaks 
decomposed by P. carnosa and P. chrysosporium. Summary of the peaks 
detected in FT-IR analysis (Figure S7) [81-83]. Table S18. Gradient 
method of UPLC elution. Summarizing elution method used in UPCL 
analyses. 

Additional file 2: Table S6. Genomic distribution of CAZymes and 
FOLymes on major scaffolds in P. carnosa. This table summarizes the 
distribution of CAZy and FOLy members located within the major 
scaffolds of the P. carnosa genome sequence. 

Additional file 3: Figure SI. Pictures of P. carnosa and P. chrysosporium 
grown on various carbon sources. P. carnosa grew significantly slower 
than P. chrysosporium; as a result, pictures were taken after 13 and 3 days 
of cultivation for P. carnosa and P. chrysosporium, respectively. 
Cultivations were performed in duplicate and no significant differences in 
colony diameter or thickness were observed between the duplicates on 
any of the carbon sources. Monomeric and oligomeric carbon sources 
were used at a final concentration of 25 mlvl, while pure polymers were 
used at a final concentration of 1%. Crude plant biomass was used at a 
final concentration of 3%. Relative growth was determined by comparing 
the radius and density of the mycelia on a particular carbon source to 
that on D-glucose. The extent of growth relative to plates containing 
glucose are summarized as follows, from high to non-detectable: +++, + 
+, +, +, -. This semi-quantitative, consistent assessment of both colony 
diameter and thickness allowed comparisons to include a broad range of 
substrates, including those that yield too little mycelia for accurate 
weight measurement or that could interfere with protein measurements 
or ergosterol production [84,85]. Complete growth profiles of P. carnosa 
and P. chrysosporium, and other fungi can be found at [19]. Figure S2. 
Phylogenetic tree of GH51 enzymes from P. carnosa and P. chrysosporium. 
Proteins are labeled with protein IDs from the JGl databases for 
Phanerochaete carnosa vl.O (Phaca) and Phanerochaete chrysosporium v2.0 
(Phchr). The sequences were aligned using MAFFT, and the tree was 
drawn by FigTree. In the heatmap bar, abbreviations are; Y, YIVIPG; F, 
balsam fir; P, lodgepole pine; S, white spruce; IVl, sugar maple [10]. Heat 
map represents the number of sequence reads per million kb as shown 
in this figure and described in [10], 0 (black) to 50,000 (pink). Figure S3. 
Phylogenetic tree of GH5 enzymes in P. carnosa and P. chrysosporium. 
The tree was generated as described in Figure S2. Of the 34 upregulated 
wood-degrading CAZymes, that were at least four times more abundant 
in P. carnosa grown on at least one wood substrate compared to 
nutrient medium, 5 (15%) were GH5 enzymes [10]. Figure S4. 
Phylogenetic tree of predicted sugar transporters and permeases from 
genomes of P. carnosa and P. chrysosporium. Proteins are labeled with 
protein IDs from the JGl database for P. carnosa vl.O (Phaca) and P. 
chrysosporium vl.O (Phchr). Protein sequences of transporters from other 
yeast and fungal species were used for pylogenetic comparison, 
including; An Aspergillus nidulans, Ao Aspergillus oryzae, Ca Candida 
albicans, CI Candida intermedia, Gz Gibberella zeae. Hp Hansenula 
polymorpha, Kl Kluyveromyces lactis. Lb Laccaria bicolor, Nc Neurospora 
crassa, Pp Postia placenta, Ps Pichia stipitis, Sc Saccharomyces cerevisiae, Sp 
Schizosaccharomyces pombe, Tm Tuber melanosporum, and Tr Trichoderma 
reesei. The GenBank accession numbers of corresponding sequences are: 
AnHypl (XP_682442.1), AnHyp2 (XP_660070.1), AnMstA (CAC80843), 



Ao_BAF58341.1 (BAF58341.1), CaHgtl (CAA76406), CaHgt4 (XP_7231 73), 
CaHgtll (XP_719597), CIGxfl (AJ937350), CIGxsl (AJ875406), GzHypl 
(FAA74528), HpGcrl (AAR88143), KIHgtl (XP_451484), KIRagl 
(XP_453656), KIRag4 (CAA75114), Lb_FDR07962 (FDR07962), NcHypl 
(XP_328858), NcHxt3 (CAD21508), NcNCU00801(FAA34565.1), 
NcNCU08114 (XP_963873.1), NcRco3 (CAF76420), Pp_l 15604 (EFD81359), 
Ps_ABN65648.2 (ABN65648.2), PsSutl (AAD00266), ScHxtl (M82963), 
ScHxt7 (NP_010629), ScSnf3 (P10870), SpGhtl (Q9P3U6), Tm-CAZ81962.1 
(CAZ81 962.1 ),TrHxtl (AAR23147), TrXltl (shown enlarged; AY81 8402), 
TrHxt2 (DQ852622; Ruohonen and (Vlargolles-Clark, unpublished). The 
sequences were aligned using MAFFT, and the tree was drawn by 
FigTree. In the heatmap bar, abbreviations are: Y YMPG, F balsam fir, P 
lodge pole pine, S white spruce, M sugar maple [10]. Group 1 contains a 
predicted monosaccharide transporter (ID 100265) and hypothetical 
proteins with high similarity to known sugar transporters. Group IV also 
contains known glucose transporters found in yeast species and 
cellulolytic fungi, including Trichoderma reesei and Neurospora crassa. 
Group V consists of high affinity glucose transporters in yeast and hexose 
transporters in fungal species, including the xylose transporter found in T. 
reesei [86]. Group VI and VII contains predicted sugar transporters and 
putative sucrose transporters (ID 89844 and 254080). Group Vlll consists 
of predicted cellobiose transporters found in yeasts and filamentous 
fungi, and biochemically characterized cellodextrin transporters from N. 
crassa (NcNCUOOSOl and 08114) [30]. Figure S5. Phytogeny, genome 
position, and intron distribution of genes encoding manganese 
peroxidases and lignin peroxidases. Protein IDs of manganese 
peroxidases (A) and lignin peroxidases (B) of P. carnosa and P. 
chrysosporium are obtained from P. carnosa vl.O and P. chrysosporium 
v2.0. Alternative names are from MacDonald ef al. [10] for P. carnosa and 
from Vanden Wymelenberg et al. [6] for P. chrysosporium. Figure S5. 
IVlycelial growth of P. carnosa and P. chrysosporium on heartwood and 
sapwood samples isolated from different hardwood and softwood 
species. Colony diameter was measured for P. carnosa grown on 
heartwood (A) and sapwood (B), and for P. chrysosporium grown on 
heartwood (C) and sapwood (D). Filled square (red), sugar maple; filled 
circle (green), yellow birch; filled triangle (blue), trembling aspen; filled 
diamond (pink), red spruce; open square (purple), white spruce; open 
circle (orange), balsam fir; open triangle (gray), red pine. Error bars show 
the standard deviation in biological triplicates. Since P. chrysosporium 
mycelia was no longer visible after day 8 of cultivation on heartwood of 
sugar maple, yellow birch, white spruce and balsam fir, those data were 
not obtained. Figure S7. Different modes of wood decay described by 
FT-IR analysis. (A) Grouping on Principal Components (PCs) 1 and 2 for 
normalized FT-IR data obtained from heartwood and sapwood samples 
of trembling aspen and red pine after cultivation of P. carnosa and P. 
chrysosporium. Circle; P. carnosa, triangle; P. chrysosporium, square; control 
(untreated wood samples). (B) PC loadings that distinguish wood samples 
treated with P. carnosa from corresponding control samples. For 
example, high positive loadings describe components in control samples 
that were lost in the decayed samples. Loadings for PCI are shown for 
trembling aspen heartwood and red pine sapwood, while loadings for 
PC2 are shown for trembling aspen sapwood and red pine heartwood. 
(C) PC loadings that distinguish wood treated with P. carnosa from 
corresponding wood samples treated with P. chrysosporium. Horizontal 
dotted lines at magnitude |0.05| represent thresholds for loading 
significance. Corresponding wavenumbers (cm"^) are indicated for peaks 
with significant loadings; identities of significant wavenumbers are 
summarized in Additional file: Table. SI 7. Percent values given in y-axes 
denote the percent of total sample variance described by the PC. 

Additional file 4: Table S7. List of CAZy members computationally 
annotated in P. carnosa. List of protein IDs computationally annotated as 
CAZy members in P. carnosa. 

Additional file 5: Table S10. List of the genes involved In carbohydrate 
metabolism. List of P. carnosa and P. chrysosporium genes involved in 
carbohydrate metabolism accompanied with gene expression data of P. 
carnosa. 
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